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Abstract 

Although small populations are expected to lose genetic diversity through 
genetic drift and inbreeding, a number of mechanisms exist that could mini- 
mize this genetic decline. Examples include mate choice for unrelated mates 
and fertilization patterns biased toward genetically dissimilar gametes. Both 
processes have been widely documented, but the long-term implications have 
received little attention. Here, we combined over 25 years of field data with 
high-resolution genetic data to assess the long-term impacts of biased fertiliza- 
tion patterns in the endangered North Atlantic right whale. Offspring have 
higher levels of microsatellite heterozygosity than expected from this gene pool 
(effect size = 0.326, P < 0.011). This pattern is not due to precopulatory mate 
choice for genetically dissimilar mates (P < 0.600), but instead results from 
postcopulatory selection for gametes that are genetically dissimilar (effect 
size = 0.37, P < 0.003). The long-term implication is that heterozygosity has 
slowly increased in calves born throughout the study period, as opposed to the 
slight decline that was expected. Therefore, this mechanism represents a natural 
means through which small populations can mitigate the loss of genetic diver- 
sity over time. 
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Introduction 

Randomly mating closed populations are expected to 
lose genetic diversity over time through genetic drift and 
inbreeding (Fisher 1930; Wright 1931; Hartl and Clark 
1997). Specifically, alleles can be lost and heterozygosity 
can decrease due to drift, whereas inbreeding reduces 
heterozygosity. However, genetic diversity plays a key 
role in shaping individual fitness (Coltman et al. 1999; 
Dunn and Byers 2008), and therefore it is not surprising 
that a variety of mechanisms have evolved that can result 



in patterns of reproduction deviating from random 
expectations in ways that increase the genetic diversity 
(and subsequent fitness) of offspring (see Jordan and 
Bruford 1998; Tregenza and Wedell 2000; and Kempena- 
ers 2007 for reviews). If such mechanisms are widespread 
in a population, then these individual-based strategies 
can reduce the rate at which genetic diversity is lost 
from the population as a whole, and thus impact the 
overall extinction probability of the population (Saccheri 
et al. 1998; Westemeier et al. 1998; Spielman et al. 
2004). 
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Examples of such mechanisms include both pre- and 
postcopulatory strategies. Precopulatory strategies (gener- 
ally referred to as "mate choice") for reducing the loss of 
genetic diversity typically involve mechanisms to avoid 
inbreeding, such as natal dispersal (Pusey and Wolf 1996; 
Szulkin and Sheldon 2008), active avoidance of mating with 
relatives (Pusey and Wolf 1996; Stow and Sunnucks 2004), 
or the preference of mates that are genetically dissimilar 
(Wedekind et al. 1995; Hoffman et al. 2007). Two primary 
postcopulatory mechanisms have been described, both of 
which have been referred to as: "genetic incompatibility." 
The first involves increased rates of fetal loss when offspring 
are too genetically similar to the mother, which results in a 
breakdown in self-/non-self-recognition and subsequent 
fetal abortion (Ober et al. 1998). The second mechanism 
involves increased fertilization/pregnancy success between 
genetically dissimilar gametes (Birkhead et al. 2004; Evans 
and Marshall 2005; Dziminski et al. 2008), which presum- 
ably results from selection for heterozygous offspring 
(Brown 1997). For species with internal fertilization, it is 
generally not known if this results from prefertilization 
processes (e.g., cryptic sperm choice by females) or from 
differential mortality of zygotes (Olsson et al. 1999). 

Although these mechanisms have been documented in 
a wide range of species, the studies have focused on 
short-term questions (e.g., whether or not the mecha- 
nisms are acting), and few data are available regarding 
the subsequent long-term/evolutionary implications. As a 
result, future progress in this field will rely on longer 
term studies of wild populations to identify the long-term 
implications of these processes in nature (Birkhead and 
Pizzari 2002; Mays and Hill 2004). 

Conceptually, it seems likely that the impacts of these 
processes will become larger as population size decreases. 
In large populations there will be a large number of mates 
and gametes to choose from. Thus, most mates or 
gametes chosen at random will meet the criteria of unre- 
lated, dissimilar, and/or compatible (whichever mecha- 
nism^] is acting in the particular population). However, 
as population size declines, the number of mates or 
gametes that meet the criteria will also decline, resulting 
in mate choice/compatibility having an increasing impact 
on patterns of reproductive success. Consequently, the 
ability to detect and study such patterns should be higher 
for small populations. Here, we use over 25 years of data 
from the North Atlantic right whale (Eubalaena glacialis) 
to test hypotheses regarding the impact of mate choice 
and genetic incompatibility on patterns of reproductive 
success in this endangered species. 

The North Atlantic right whale (Fig. 1) is one of the 
world's most endangered large whales, with 400-500 indi- 
viduals remaining (Hamilton et al. 2007; Pettis 2012). 
Despite over 75 years of international protection, there 
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Figure 1. A mother and calf North Atlantic right whale (Eubalaena 
glacialis). Photo taken by Jessica Taylor/New England Aquarium under 
NOAA permit #655-1652-01. 

has been little sign of recovery (Kraus et al. 2005). The 
two most likely factors limiting population recovery are a 
high rate of anthropogenic mortality due to ship strikes 
and entanglement in fishing gear (Kraus et al. 2005), and 
a reproductive rate that is substantially lower than their 
known potential (Frasier et al. 2007a; Kraus et al. 2007; 
Browning et al. 2010). 

North Atlantic right whales have among the lowest levels 
of genetic variability yet reported for a wild population 
(Frasier et al. 2007b), which has led to the hypothesis that 
low genetic diversity is at least partially responsible for the 
reduced reproductive performance. Preliminary support 
for this hypothesis came from restriction fragment length 
polymorphism (RFLP) studies, which found that band 
sharing between mothers and offspring was significantly 
lower than expected (Schaeff et al. 1997). This result 
suggested that mating events between genetically similar 
individuals are less successful, whereas those between 
genetically dissimilar individuals were more likely to result 
in successful fertilizations and pregnancies. Here, we com- 
bined data from 28 microsatellite loci with data from the 
major histocompatibility complex (MHC) to extend these 
analyses and test hypotheses regarding the influence of 
genetic variability on patterns reproductive success. Specifi- 
cally, we tested three major hypotheses for each type of 
marker (microsatellites and the MHC): (1) offspring have 
higher levels of genetic variability than expected from ran- 
dom mating/fertilization within this gene pool; (2) individ- 
uals show precopulatory mate choice for genetically 
dissimilar mates; and (3) pregnancies or fertilizations are 
more successful between genetically dissimilar gametes, or 
when the gametes result in a fetal genotype dissimilar from 
that of the mother. 
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Methods 

Photo-identification and maternity data 

Individual North Atlantic right whales are recognized 
using natural markings: primarily callosity patterns on 
their heads, as well as variation in ventral pigmentation 
and scarring patterns (Kraus et al. 1986; Hamilton et al. 

2007) . Photo-identification research has been ongoing, 
almost year round, since 1980. Calves nurse for ~1 year 
(Hamilton et al. 1995), and therefore mothers are identi- 
fied by this long-term association with their calves 
(Knowlton et al. 1994). Given the extensive nature of the 
surveys on both the calving grounds and in spring, sum- 
mer, and fall nursery and feeding areas, most calving 
events are identified each year (Browning et al. 2010). 

Genetic and paternity data 

Small skin samples have been collected from right whales, 
in association with the photo-identifications, since 1988 
using a crossbow equipped with a modified bolt and 
biopsy dart (Brown et al. 1991). DNA is extracted from 
each sample using standard protocols (e.g., Wang et al. 

2008) . Individual-specific genetic profiles are obtained for 
each sample based on molecular sex determination (Shaw 
et al. 2003), sequencing a portion of the mitochondrial 
control region (Malik et al. 1999), and genotype analysis 
at 35 micro satellite loci (Frasier et al. 2006). Although all 
samples are typed at 35 loci, 28 were used for subsequent 
analyses, with two loci being removed due to high esti- 
mates of null alleles and five being removed due to signif- 
icant signs of linkage (Frasier et al. 2007a). These 28 loci 
provide very high resolution for discriminating between 
individuals and assessing relatedness. For example, the 
probability of identity -P(id) (Paetkau and Strobeck 1994) 
is 5.92 x 10" 11 , and the P (ID ) sib (Waits et al. 2001) is 
3.61 x 10~ 5 . A comparison of the genetic and photo- 
identification data showed that error rates associated with 
both methods are very low (0.0308 errors/identification 
for the photo-ID data, and 0.00121 errors/locus for the 
genetic data), indicating that both provide reliable means 
of individualization for this species (Frasier et al. 2009). 
Previous analyses based on these data showed that with 
this large number of loci, different paternity assignment 
methods converge on similar assignments (Frasier et al. 
2007a). As a result, paternity is assigned on an annual 
basis using the method of exclusion, where paternity is 
only assigned when all but one male is nonexcluded as a 
potential father, and all other males mismatch at a mini- 
mum of two loci. These paternity assignments are stored 
in the genetic database. Two mismatching loci were 
chosen as the criterion for exclusion to prevent false 



exclusions from potentially being made as the result of 
mutations or genotyping errors. Based on available data 
on microsatellite mutation rates (e.g., Ellegren 2004), and 
genotyping error rates in this study (Frasier et al. 2009), 
the probability of two events in one mother-father-off- 
spring triad is exceedingly low. Based on this same logic, 
two mismatching loci are also the criterion for exclusion 
in human forensic paternity cases (e.g., Butler 2005), and 
therefore this criterion also seemed appropriate for our 
study. Frasier et al. (2007a) reported paternity assign- 
ments from 1980 to 2001, and here we have added data 
from years 2002-2006, as well as a reanalysis of all previ- 
ous years to account for more potential fathers that have 
been sampled. 

Although the standard method used for paternity 
assignment in this species is based on exclusion, in this 
study we also used the 95% criterion in cervus (Marshall 
et al. 1998). Specifically, paternity analyses were con- 
ducted independently for each year, to account for the 
maturation of calves from earlier years to parents in later 
years, and changes in the pool of candidate males each 
year. Critical values of A (the ratio of the likelihood of 
the two most likely males as fathers) for the 95% confi- 
dence criterion were estimated based on 10,000 simula- 
tion cycles and allowing a genotyping error rate of 0.010. 

All subsequent analyses were conducted using each of 
these data sets (i.e., once using the paternities assigned 
using exclusion, and a second time using the paternities 
assigned using the 95% criterion of cervus). Because all 
subsequent analyses (and interpretation) are based on the 
paternity assignments, this approach ensures that the 
results obtained were robust to different strategies of 
paternity assignment. 

It has recently been recognized that paternity assignment 
methods can produced biased results, where assigned 
fathers may be biased toward males that are more/less het- 
erozygous, or more/less genetically similar to the mother, 
depending on the assignment method used and the charac- 
teristics of the loci (Wetzel and Westneat 2009; Wang 
2010). The recommended solution to this problem is to use 
one set of markers for paternity assignment, and a different 
set of markers for subsequent analyses. Preliminary tests of 
this approach indicated that this was not an option with 
our data set due to the very low levels of genetic variability 
in this species; hence, all loci were needed for confidence 
in paternity assignment. Although this recommended 
approach could not be taken, there are other means to 
assess whether or not the results obtained have been biased 
in this manner. We have taken steps to ensure that our 
results are not biased by these paternity assignment issues, 
and these are discussed in greater detail in the Discussion. 

For this study, a 240 bp fragment of exon 2 of the class 
II DR/3 major histocompatibility (MHC) region of all 
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mother-father-offspring triads was also analyzed to aid 
our assessment of reproductive patterns (Gillett 2009). 
This region was chosen because it is known to influence 
patterns of precopulatory mate choice, as well as postcop- 
ulatory fertilization patterns and zygote survival in many 
vertebrates (Wedekind et al. 1995; Ober et al. 1998; Jacob 
et al. 2002; Milinski 2006). 

Statistical analyses 

Genetic variation in offspring was quantified using two 
metrics, internal relatedness (IR, Amos et al. 2001) and 
homozygosity by loci (HL, Aparicio et al. 2006). Both of 
these metrics are commonly used to quantify the genetic 
diversity of individuals (e.g., Rijks et al. 2008) and are 
essentially weighted measures of heterozygosity, where the 
weights are based on the frequencies of the two alleles 
present (IR), or the allelic variability in each locus (HL). 
Simulations, generated using the program STORM v. 2.0 
(Frasier 2008), were used to test if offspring have higher 
levels of heterozygosity than expected given the gene pool, 
as was observed with the RFLP data. Briefly, whales that 
had been assigned as parents represented the available 
gene pool. Simulated mating pairs were generated by 
sampling only these individuals, with replacement, to gen- 
erate the same number of mating pairs as for the 
observed data. One offspring for each pair was then gen- 
erated based on Mendelian inheritance, and IR and HL 
were calculated for each offspring. This process was 
repeated 1000 times to generate the expected distributions 
of these metrics for this gene pool. Calculations of effect 
size for all comparisons between observed and expected 
values were based on Cohen's d, d = (fa — ^WKffi 2 + 
°2 2 )/2] (Cohen 1988), where fi and a represent the mean 
and standard deviation, respectively. 

An observed excess of heterozygosity in offspring could 
be due to three possible processes: (1) precopulatory mate 
choice for genetically dissimilar (or unrelated) mates; (2) 
genetic incompatibility based on fetal loss resulting from 
a breakdown in self-/non-self-recognition; and (3) genetic 
incompatibility based on heterozygosity. 

Randomization processes were used to test if individu- 
als show mate choice preferences for genetically dissimilar 
mates, where relatedness was used as a proxy for genetic 
similarity. The relatedness of observed mating pairs was 
calculated using the method of Li et al. (1993), with each 
locus weighted as described in Lynch and Ritland (1999). 
Then, individuals were randomly shuffled to create a new 
set of observed mating pairs, and the relatedness values 
for these randomized pairs were calculated. This shuffling 
process was repeated 1000 times to generate the distribu- 
tion of expected relatedness values if individuals do not 
show a preference for dissimilar mates. 



Each of the two hypotheses of genetic incompatibility 
results in expectations of how allele inheritance in 
observed (surviving) offspring should differ from Mende- 
lian expectations. Under the hypothesis of fetal loss based 
on self-/non-self-recognition (AI FL ), observed offspring 
should represent those fertilizations where the zygote dis- 
proportionately inherited paternal alleles that resulted in 
a fetal genotype dissimilar from that of the mother. 
Whereas under the hypothesis of genetic incompatibility 
based on heterozygosity (AI HET ), observed offspring 
should represent those fertilizations where the zygote dis- 
proportionately inherited paternal alleles that differed 
from the alleles inherited from the mother. Although 
these two processes of genetic incompatibility have simi- 
larities, they result in different predictions for genetic sig- 
natures in surviving offspring, as depicted in Figure 2. 

Expected patterns of allele inheritance were generated 
by keeping observed mating pairs constant, and generat- 
ing simulated offspring for each based on Mendelian 
inheritance. This process was repeated 1000 times to gen- 
erate the distribution of expected allele inheritance values 
under Mendelian inheritance (e.g., without the effects of 
genetic incompatibility). The observed allele inheritance 
patterns were then compared to expected values to test if 
alleles were inherited in a pattern consistent with either 
of the two hypotheses of mate incompatibility. These 
analyses were conducted using the program STORM. 

To assess if any observed biases in allele inheritance 
were due to genome-wide effects or due to a strong signal 
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Figure 2. Example pedigrees showing expectations in the genotypes 
of surviving offspring under hypotheses of Mendelian inheritance, 
genetic incompatibility based on self-/non-self-recognition (Gl 1), and 
genetic incompatibility based on heterozygosity (Gl 2). Arrows 
indicate either an increase or decrease relative to Mendelian 
expectations. Pedigree 1 represents a scenario where both processes 
result in the same expectations, and pedigree 2 represents a scenario 
where the expectations are different. Again, the mechanism of Gl 1 
results in offspring genetically dissimilar from the mother, whereas 
the mechanism of Gl 2 results in heterozygous offspring, regardless of 
their similarity to the mother. 
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from one or a few loci, each locus was tested indepen- 
dently for differences between observed and expected 
allele inheritance patterns, and critical alpha values to 
delineate statistical significance were adjusted using a 
Bonferroni correction for multiple tests (Hochberg 1988). 

To compare observed changes in heterozygosity of mi- 
crosatellite loci over time with those expected without a 
bias in fertilization/pregnancy success, the observed 
mating pairs for each year were kept constant, and one 
offspring was generated for each based on Mendelian 
inheritance. This process generated the observed number 
of offspring for each year, from the assigned parents, but 
with genetic characteristics based solely on Mendelian 
inheritance. This process was repeated 1000 times to gen- 
erate heterozygosity values expected for the offspring 
in each year of the study. The trend line for observed 
heterozygosity over time was then compared to that 
obtained with the simulated data to test if the observed 
trend is different than expected without biased fertiliza- 
tion/pregnancy patterns. The P-value was estimated based 
on the number of simulated data sets that had a trend 
line as high, or higher, than the observed data. 

Results 

Paternity analyses were conducted on 180 identified 
mother-offspring pairs. With the exclusion method, 
paternities were assigned to 105 calves. All sampled males 
could be excluded as the father for the remaining 75 
calves, and there were no unresolved paternities (cases 
where more than one male was nonexcluded as a poten- 
tial father). Paternity assignments were similar using the 
95% criterion in cervus, with 112 paternities being 
assigned (Table SI). There were no discrepancies in pater- 
nity assignments between the two approaches (i.e., there 
were no instances where a calf was assigned one male as a 
father with one approach, and a different male as a father 
using the other). Moreover, the results of all subsequent 
analyses were the same regardless of which paternity 
assignment method was used. Therefore, for clarity sake, 
only those results obtained using the 105 mother-off- 
spring-father triads assigned using the exclusion approach 
are reported here. Analysis of the MHC was only con- 
ducted on triads identified through 2004, which represent 
78 of these triads. Thus, analyses of the MHC data are 
based on this slightly smaller data set. 

As observed with the RFLP data, levels of genetic vari- 
ability at microsatellite loci in offspring were significantly 
higher than expected given this gene pool, with only 10 
of the 1000 simulated data sets having an average HL 
value as low or lower than that observed (observed mean 
HL = 0.549, 95% CI of expected means = 0.553-0.585, 
effect size = 0.326, P < 0.011) (Fig. 3A), and 18 having 
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Figure 3. Observed and expected patterns of genetic characteristics 
based on microsatellites. In all cases, histograms represent the 
distribution of expected values and the dashed lines represent the 
observed values. (A) Observed and expected levels of genetic 
variability (HL) in the 105 offspring with both parents assigned. For 
the observed data, the mean and standard deviation for HL were 
0.55 and 0.085, respectively. Note that HL is a measure of 
homozygosity, thus lower HL values correspond to higher 
heterozygosity. (B) Observed mating pair relatedness (MPR) and 
expected values if mate choice is random with respect to relatedness 
based on the 105 identified mating pairs, which consisted of 45 
different males and 50 different females. The average relatedness 
between observed mating pairs was -0.02, with a standard deviation 
of 0.56. (C) Observed and expected allele inheritance (Al) patterns, 
where Al is a weighted metric quantifying the frequency at which a 
paternal allele is inherited that is different than that inherited from 
the mother (Frasier 2008). 
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an average IR value as low or lower than that observed 
(observed mean IR = —0.0272, 95% CI of expected 
means = -0.0250-0.0412, effect size = 0.318, P < 0.019). 
This excess of heterozygous offspring does not appear to 
result from precopulatory mate choice for genetically 
dissimilar mates because the relatedness of mating pairs 
was not significantly lower than expected values if mate 
choice is random with respect to relatedness (observed 
mean r = -0.0202, 95% CI of expected means = -0.200- 
0.0749, P < 0.600) (Fig. 3B). 

The analyses did not detect a signature of allele inheri- 
tance as expected under the hypothesis of genetic incom- 
patibility based on self-/non-self-recognition (observed 
mean AI FL = 1.27, 95% CI of expected means = 1.15— 
1.38, P < 0.743). However, offspring did inherit paternal 
alleles different than those inherited from the mother 
more often than expected (observed mean AI H et = 1-40, 
95% CI of expected means = 1.16-1.37, effect size = 
0.367, P < 0.003), as predicted by the hypothesis of 
genetic incompatibility based on heterozygosity (Figs. 3C, 
4). No single locus showed a significant signal of this pat- 
tern, but all showed a tendency toward the biased inheri- 
tance of a divergent paternal allele. This result suggests 
that the observed patterns are due to selection of diver- 
gent paternal alleles at several loci throughout the genome 
(i.e., a genome-wide effect), as opposed to being the 
result of particularly strong selection at a single locus. 

At the MHC, offspring did not have significantly higher 
heterozygosity than expected given the gene pool (IR, 
P < 0.518; HL, P < 0.394) (Table S2). There was not a 
signal for preference for MHC dissimilar mates 
(P < 0.685), or for either biased pattern of allele inheri- 



tance (Table S2). However, previous studies in humans 
only found evidence for biased allele inheritance in pedi- 
grees where the mother was heterozygous and the father 
was homozygous (Ober et al. 1998). When the right 
whale data were evaluated in this way, there was indeed a 
trend for offspring to inherit paternal alleles dissimilar to 
those inherited from the mother (Figure SI). However, 
the number of MHC-typed pedigrees that fit this scenario 
was low (n = 8), and therefore these analyses will be 
more informative in the future as more such pedigrees 
become available. 

Heterozygosity at microsatellite loci of calves born 
throughout the study period showed a slight increasing 
trend (Fig. 5). This trend is significantly greater than 
would be expected if there was no selection for genetically 
dissimilar gametes, which would result in an expected 
slight decrease in heterozygosity over the same time 
period (observed slope = 0.00242, 95% CI of expected 
slopes = -0.00125-0.000763, P < 0.05) (Fig. 5). It is 
noteworthy that such a temporal trend was found given 
that the length of the study period is short relative to the 
generation time and lifespan in this species. The implica- 
tion is that the true trend could be much greater than 
that detected, but would require a much longer study 
period to be adequately quantified. 

Discussion 

Our results show that microsatellite heterozygosity is higher 
in offspring North Atlantic right whales than is expected 
from the gene pool, which appears to result from fertiliza- 
tions and/or pregnancies being more successful between 
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Figure 4. Informative pedigree structures for analyses of biased allele inheritance patterns toward heterozygous offspring for the microsatellite 
data. Letters do not represent actual alleles, but rather are used to designate whether or not alleles are shared. For example, there were 373 
mother-offspring-father triads where the mother was homozygous and the father was heterozygous, sharing one allele with the mother. "Nl" 
indicates triad configurations that were not informative due to the offspring inheriting a maternal allele that was not present in the father, and 
therefore would be heterozygous regardless of which allele was inherited from the father (i.e., there was not an opportunity for biased allele 
inheritance). The number of pedigrees here exceeds the number of calves with paternities assigned because each locus for a mother-father-triad 
can represent a different pedigree structure, with respect to allele inheritance. Thus, each triad at each locus represents one assessment of allele 
inheritance, for a total of (105 mother-father-offspring triads) x (28 microsatellite loci) = 2940 triad-by-locus pedigrees. The number of 
pedigrees shown here is less than 2940 because many pedigrees were uninformative (e.g., the mother and father were homozygous, and 
therefore there was not the opportunity for biased allele inheritance). 
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genetically dissimilar gametes. Moreover, this process has 
resulted in heterozygosity actually increasing in calves born 
throughout the 25-year study period, as opposed to the 
decrease expected if fertilization patterns were random. 

Small populations are expected to lose genetic diversity 
over time through genetic drift and inbreeding: these 
principles form the basis of conservation genetics (Frank- 
ham et al. 2002; Allendorf and Luikart 2006). Indeed, this 
expected loss of genetic variability, and the subsequent 
decline in fitness, has been used to provide guidelines as 
to when populations have become too small to warrant 
conservation efforts due to this inevitable genetic decline 
(e.g., Franklin 1980). However, our results show that this 
process (genetic incompatibility based on heterozygosity) 
provides a natural means through which the loss of 
genetic diversity can be minimized in small populations. 
This important long-term implication has received sur- 
prisingly little attention. 

Findings of similar biases in fertilization/pregnancy pat- 
terns are widespread (e.g., Olsson and Shine 1997; Birkhead 
et al. 2004). However, most studies focus on short-term 
implications, such as identifying whether or not a specific 
mechanism is acting, or how such mechanisms influence 
mating systems and patterns. One notable exception is a 
long-term study of a small population of Scandinavian 
wolves (Canis lupus). In this population, reproductive suc- 
cess is limited to individuals that are particularly heterozy- 
gous, which has the long-term effect of maintaining 
heterozygosity over generations despite an increase in the 
amount of inbreeding (Bensch et al. 2006). Combining the 
study of Bensch et al. (2006) and the results presented here 
shows that there is increasing evidence that the genetics of 
small populations is more complex than previously 
thought. Specifically, there are a number of widely docu- 
mented natural mechanisms - such as mate choice for 
genetically dissimilar mates, or fertilization bias toward dis- 
similar gametes - that can have an increasing impact on 
overall patterns of genetic diversity as population size 
declines, that can serve to minimize the loss of genetic 
diversity, and the associated negative consequences, 
through time. 

Although similar biases in fertilization/pregnancy suc- 
cess have been found in several other species, a mecha- 
nism for this process has not yet been identified. One 
primary question is whether the observed patterns are 
due to biased fertilization patterns (e.g., "cryptic female 
choice" for dissimilar sperm) or biased mortality of 
zygotes (Olsson et al. 1999). If it results from biased mor- 
tality of zygotes, then there should be some impact on 
female reproductive success: where females that lose 
zygotes later in pregnancy may not be able to become 
pregnant again during that breeding season (i.e., the later 
the fetus is aborted, the lower the probability of becoming 
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Figure 5. Observed (points and solid trend line) and expected 
(dashed line) trends in multilocus heterozygosity of calves born 
throughout the study period. This analysis was based on data from 
the 105 calves with both parents assigned, so that simulations could 
be conducted using the identified parents and Mendelian inheritance 
to generate the expected changes in heterozygosity over time if 
mating and fertilization patterns were random. 

pregnant again that season). It is also possible that some 
of this pattern could be due to differential mortality of 
homozygous newborn calves. Thus, it could result from 
processes anywhere along a continuum from differential 
fertilization success by dissimilar gametes, through differ- 
ential mortality of homozygous zygotes throughout devel- 
opment, to the differential loss of homozygous newborns. 
Further work is needed to differentiate between these 
alternatives. 

There is substantial evidence of fetal loss in North 
Atlantic right whales (Browning et al. 2010). Briefly, dur- 
ing the winter most pregnant females migrate to calving 
grounds off the coastline of Georgia and Florida. How- 
ever, each year a few females who are due to reproduce 
migrate to the calving grounds but do not give birth. This 
suggests that they may have been pregnant at one time - 
which triggered them to migrate to the calving grounds - 
but that either the fetus was lost, or the calf was lost 
before it was detected. Using data such as these, Browning 
et al. (2010) estimate that the rate of fetal loss could be 
as high as 3 per year, which is substantial given that dur- 
ing this same time period the average number of calves 
born (for the entire species) was 14 per year. Thus, it is 
possible that decreased survival of zygotes from geneti- 
cally similar gametes is at least partially responsible for 
these unsuccessful pregnancies. However, several other 
potential factors, such as infectious diseases and toxins, 
may also be influencing the rate of fetal loss (e.g., Douc- 
ette et al. 2012), and further studies are needed to eluci- 
date the relative roles of these different factors. 

The mating system of right whales represents perhaps 
the most extreme example of sperm competition in 
mammals (e.g., Frasier et al. 2007a), with males having 
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the largest testis size-to-body size ratio of any other mam- 
malian species (Brownell and Ralls 1986; Frasier et al. 
2007a). Sperm competition results in sperm from multi- 
ple males simultaneously coexisting in the female repro- 
ductive tract, which, in addition to providing an 
environment for competition between sperm, is also an 
ideal environment for females to exercise preferences for 
sperm with certain traits (e.g., Olsson et al. 1996). 
Although the mechanism(s) through which females may 
be able to select the genotype of sperm are currently 
unknown, such mechanisms do clearly exist in some spe- 
cies (e.g., Carre and Sardet 1984), and likely involve genes 
expressed on the surface of sperm (e.g., Martin- Villa et al. 
1999; Dorus et al. 2010). 

From an evolutionary perspective, the cryptic female 
choice of dissimilar sperm would be less costly to right 
whale females than the mechanism of differential mortal- 
ity of zygotes. Because the breeding season for this species 
is limited, females losing zygotes that are homozygous 
would reduce the likelihood of those females becoming 
pregnant that year. Thus, strong selection pressures would 
exist for earlier detection of which sperm would produce 
viable offspring. However, with the current data it is not 
possible to differentiate between these two potential 
mechanisms, and other data (such as the high estimated 
rate of fetal loss, and intense sperm competition) suggest 
that both mechanisms are feasible in this species. 

Lastly, the question remains of how either of the 
hypothesized mechanisms could maintain genome-wide 
heterozygosity. Although mechanisms have been identified 
for maintaining heterozygosity at the specific loci 
expressed on the surface of sperm and/or at the mater- 
nal-fetal interface (Ober et al. 1998; Martin- Villa et al. 
1999; Parham 2005), it is not immediately clear how they 
could result in the genome-wide patterns that we 
observed. The most likely explanation of this is what is 
often referred to as "associated overdominance," where a 
series of genes throughout the genome are driving pat- 
terns of gametic incompatibility, and the presumably neu- 
tral markers examined show similar patterns due to 
linkage disequilibrium with these genes (Bierne et al. 
2000; Hansson and Westerberg 2002). The theoretical 
rationale for this idea is that linkage disequilibrium (or, 
more appropriately, gametic disequilibrium) can develop 
throughout the genome in recently bottlenecked popula- 
tions, resulting in genome-wide patterns of linkage or 
extremely large regions of linkage (e.g., AUendorf and 
Luikart 2006). For example, Reich et al. (2001) found that 
linkage extended to greater than 100 kb for some loci in 
the U.S. human population, with this large range result- 
ing as a side effect of a recent bottleneck. Given the his- 
tory of right whale exploitation, and its current small 
population size, it seems likely that genome-wide patterns 



of gametic disequilibrium could have developed, resulting 
in the genome-wide patterns that we observed. 

Although statistically significant results were not also 
obtained for the MHC, a clear trend was detected in the 
manner expected under the hypothesis of biased allele 
inheritance patterns based on heterozygosity. Continuing 
these studies, as more pedigrees become available, will 
provide more adequate sample sizes for such analyses. It 
is interesting to note that heterozygosity was substantially 
higher for the MHC than for the microsatellite loci (e.g., 
HL = 0.397 vs. 0.550, respectively). It is widely docu- 
mented that pathogens provide strong selection pressures 
on the MHC, often in ways that lead to heterozygote 
advantage, resulting in the ability to recognize and 
respond to a wider range of pathogens (e.g., Hedrick 
1994). Thus, the already high heterozygosity at the MHC, 
combined with the range of selection pressures acting 
on it, may mask the signal of biased allele inheritance 
patterns at this locus. Indeed, the high heterozygosity in 
parents resulted in only a few pedigrees meeting the crite- 
ria where biased allele inheritance would be expected. 
Although the MHC is often used as a marker of choice 
for studies of differential mating and fertilization patterns 
(e.g., Penn and Potts 1999; Bos et al. 2009), our analyses 
clearly show that several other regions throughout the 
genome also likely play an important role. 

It has recently been recognized that paternity assign- 
ment methods (based on both exclusion and likelihood) 
can result in biased paternity assignment toward males 
that are particularly more/less heterozygous and more/less 
related to the mother - depending on the allele frequency 
distributions - (Wetzel and Westneat 2009; Wang 2010), 
which can obviously lead to biases in subsequent analyses 
regarding mate preferences. The suggested solution is to 
use one subset of markers for paternity assignment and 
another subset for any subsequent analyses (Wetzel and 
Westneat 2009; Wang 2010). The low levels of genetic 
variation in North Atlantic right whales prevented us 
from using this approach because each locus is needed to 
obtain high resolution in the paternity analyses. However, 
several lines of evidence indicate that our results are not 
an artifact of such biases. 

First, the only analyses that we conducted that could 
have been affected by such a bias (other than paternity 
itself) were our analysis of mate choice, where we tested 
if mates are particularly genetically dissimilar. If our 
exclusion-based paternity assignment was biased in the 
manner suggested, mothers and assigned fathers would be 
falsely inferred to be related ("...mothers and nonexclud- 
ed males are falsely inferred to be related when the 
same marker is used for both paternity and relatedness 
analyses, and the conclusion is true no matter which 
relatedness estimator is used and how allele frequencies 
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are distributed." [Wang 2010, p. 1903]). We found no 
such pattern (Fig. 3B), and the relatedness of assigned 
fathers to mothers had an average of ~0. More impor- 
tantly, the simulations used to test this hypothesis showed 
that assigned fathers were no more/less related to the 
mother than randomized males in the population. These 
data show that paternity assignments are not biased in 
the manner proposed. 

Second, Wang (2010) showed that exclusion and likeli- 
hood paternity assignment methods can lead to opposite 
biases. For example, when comparing data sets of the 
same number of loci and allele distributions, exclusion 
methods tended to result in biased assignment of pater- 
nity toward males who appear more related to the moth- 
ers, whereas likelihood paternity assignment results in 
biased paternity assignment toward males who appear less 
related to mothers. We used both likelihood and exclu- 
sion paternity assignment methods, and there were no 
discrepancies in assignments between the two, suggesting 
that our assignments are not the result of such artifacts. 
Moreover, we observed the same results across all subse- 
quent analyses (i.e., of mate choice and allele inheritance) 
regardless of which paternity assignments were used. 
These data indicate that our subsequent analyses are not 
the result of biases resulting from paternity assignment 
methods. 

Third, our subsequent analyses were based on assessing 
allele inheritance patterns given the assigned parents. These 
analyses were based on comparing the observed dissimi- 
larity of maternally and paternally inherited alleles to 
expected values generated by simulating offspring under 
Mendelian inheritance from the assigned parents. This 
approach means that, even if paternity was biased toward 
heterozygous males, it would not artificially inflate such 
an analysis. For example, if paternity was biased toward 
heterozygous males, it would be more likely for offspring 
to inherit differing maternal and paternal alleles, which 
would artificially inflate our "expected" values of allele 
inheritance to which we compared the observed data 
(which would also be similarly biased). Thus, because all 
analyses are based on assigned parents, they are essentially 
"standardized," where the observed and expected data are 
calculated from the same paternity assignments, and 
would therefore be similarly affected, which serves to 
eliminate any potential artifacts that could result from 
biased paternity assignment. 

Lastly, if our primary finding that offspring are more 
heterozygous than expected from this gene pool is an arti- 
fact of biased paternity assignments, then calves with 
fathers assigned should be more heterozygous than calves 
without fathers assigned. Figure 6 shows the data com- 
paring heterozygosity values between calves with and 
without paternity assigned, and it can clearly be seen that 
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Figure 6. Heterozygosity values for calves with (A/ =105) and 
without (N = 72) paternity assigned. The dark horizontal lines indicate 
the mean, the boxes encapsulate the middle 50% range (values in 
the second and third quartiles), and the whiskers delimit 1.5 times the 
interquartile range, which is roughly 2 SD. 



the means and ranges of the two distributions are very 
similar (means are 0.315 vs. 0.299), although a larger pro- 
portion of calves with fathers assigned have lower hetero- 
zygosity values. As expected from this figure, there is not 
a statistically significant difference between the means of 
these two groups (t-test, t = 1.57, df = 148, P = 0.12). 

Combined, these data show that the results and inter- 
pretations we have made are not an artifact of biases in 
paternity assignment methods. 
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Supporting Information 

Additional Supporting Information may be found in the 
online version of this article: 

Figure SI. Informative pedigrees for MHC data. This 
deviation from expected values is expected if reproduc- 
tion is biased towards more heterozygous offspring. How- 
ever, note the small number of MHC pedigrees that had 
this structure. 

Table SI. Paternities assigned, by year, using the methods 
of exclusion and the 95% criterion of cervus. In both 
cases, paternity is assessed independently for each year to 
account for a changing pool of candidate males over time, 
as described in Frasier et al. (2007a). 
Table S2. Observed mean values and 95% confidence 
intervals of expected values for metrics associated with 
analyses of the MHC data. Metrics include internal relat- 
edness of calves (IR), heterozygosity-by-loci of calves 
(HL), mating pair relatedness (MPR), allele inheritance 
under the fetal loss scenario (AI FL ), and allele inheritance 
under the heterozygosity scenario (AI H et)- 
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